Methodological confounds of measuring urinary oxidative stress in wild animals

Abstract Biomarkers of oxidative stress (OS) are useful in addressing a wide range of research questions, but thus far, they have had limited application to wild mammal populations due to a reliance on blood or tissue sampling. A shift toward non‐invasive measurement of OS would allow field ecologists and conservationists to apply this method more readily. However, the impact of methodological confounds on urinary OS measurement under field conditions has never been explicitly investigated. We combined a cross‐sectional analysis with a field experiment to assess the impact of four potential methodological confounds on OS measurements: (1) time of sampling, (2) environmental contamination from foliage; (3) delay between sample collection and flash‐freezing in liquid nitrogen; and (4) sample storage of up to 15 months below −80°C. We measured DNA oxidative damage (8‐hydroxy‐2′‐deoxyguanosine, 8‐OHdG), lipid peroxidation (malondialdehyde, MDA), total antioxidant capacity (TAC), and uric acid (UA) in 167 urine samples collected from wild Zanzibar red colobus (Piliocolobus kirkii). We found that MDA was higher in samples collected in the morning than in the afternoon but there were no diurnal patterns in any of the other markers. Contamination of samples from foliage and length of time frozen at −80°C for up to 15 months did not affect OS marker concentrations. Freezing delay did not affect OS levels cross‐sectionally, but OS values from individual samples showed only moderate‐to‐good consistency and substantial rank‐order reversals when exposed to different freezing delays. We recommend that diurnal patterns of OS markers and the impact of storage time before and after freezing on OS marker concentrations be considered when designing sampling protocols. However, given the high stability we observed for four OS markers subject to a variety of putative methodological confounds, we suggest that urinary OS markers provide a valuable addition to the toolkit of field ecologists and conservationists within reasonable methodological constraints.


| INTRODUC TI ON
Non-invasive techniques have revolutionized the field of ecophysiology by, firstly, reducing the adverse effects of field research on animal welfare and behavior, and, secondly, limiting the confounding effects of the stress of capture and restraint on biomarker measurements Nwunuji et al., 2014). This has allowed us to address new questions in previously inaccessible study systems Narayan, 2013). But while non-invasive methods are commonly applied to study animal energetics (Emery Thompson, 2017), endocrinology (McCormick & Romero, 2017) and, more recently, immune function Tombak et al., 2020), non-invasive studies of oxidative stress physiology in the wild are rare (Thompson González et al., 2020).
Oxidative stress (OS) is caused by the imbalance between the production of reactive oxygen species and the production, dietary intake, and repairing action of antioxidants. Oxidative stress can damage cellular DNA, proteins, and lipids (Valko et al., 2007). This can have negative consequences for health, reproduction, and survival (Bize et al., 2008;Saino et al., 2011;Sebastiano et al., 2017) making OS markers highly relevant to individual fitness. Additionally, oxidative stress is universal to aerobic organisms, meaning that OS can be measured in a wide range of taxa (Beaulieu & Costantini, 2014). Uniquely among physiological markers, both the damage and protection aspects of OS can be measured separately, meaning that OS markers can give us unparalleled insights into both the costs facing an organism and an organism's ability to cope with these costs (Beaulieu & Costantini, 2014). These properties of OS markers make them particularly suitable tools for the study of life-history trade-offs Monaghan et al., 2009;Speakman et al., 2015;Thompson González et al., 2020) and the impacts of anthropogenic disturbance (Semeniuk et al., 2009). Indeed, many studies have investigated these topics in wild animals but have relied exclusively on blood or tissue samples to quantify OS (e.g., Ovis aries: Nussey et al., 2009;Christensen et al., 2016; Mungos mungo: Vitikainen et al., 2016;Macaca mulatta: Georgiev, Muehlenbein, et al., 2015;Georgiev, Emery Thompson, et al., 2015;Dasyatis americana: Semeniuk et al., 2009;and Mandrillus sphinx: Beaulieu et al., 2014). The reliance on invasive sampling has thus precluded the broader adoption of OS markers in field research, especially in the case of large, endangered, and difficult-to-capture animals.
A shift to a non-invasive approach for quantifying OS will allow us to study a wider range of animals, address research questions requiring longitudinal measurements, and reduce the risk posed to study species. Urinary OS markers provide a promising alternative to blood and tissue sampling for field studies. Markers of OS are more stable in urine than in blood (Il'yasova et al., 2012) and have been used in clinical research since at least the 1980s (Cathcart et al., 1984). More recently, urinary OS markers have been applied in studies of captive animals in laboratory and zoo settings (Cho et al., 2009;Marchal et al., 2013;Costantini et al., 2021,) but thus far, only one study has measured OS in wild animals via non-invasive sampling (Thompson González et al., 2020). In their study of wild chimpanzees, Thompson González et al. (2020) showed that MDA-TBARS (a marker of lipid peroxidation) was higher and total antioxidant capacity was lower later in the day and they found a weak negative relationship between storage time and MDA-TBARS. However, a more explicit investigation of a broader range of methodological confounds in measuring OS in the field is essential to aid the planning of robust field research using OS markers in the future.
In this study we, therefore, aimed to explicitly examine four major confounds that can affect the interpretation of OS values in opportunistically collected urine samples in a remote field setting: (1) time of sampling; (2) environmental contamination from foliage during sample collection; (3) a delay between sample collection and flash-freezing in liquid nitrogen; and (4) prolonged sample storage at or below −80°C.
First, any circadian variation in marker excretion can be problematic when relying on imbalanced datasets that are characteristic of opportunistic sampling. Evidence of diurnal variation in OS is mixed and differs between markers and sample media, both in humans (Alajbeg et al., 2017;Singh et al., 2004;Valencia et al., 2001) and chimpanzees (Thompson González et al., 2020). Therefore, an assessment of diurnal variation in multiple OS markers in an additional species, in particular a non-ape species, will provide a valuable addition to this literature and aid in determining the potential impact of imbalanced sampling on the analysis of OS markers.
Second, we also examined the effect of possible environmental contamination on OS marker levels. Evidence for significant and consistent impacts of environmental contaminants on urinary biomarkers is mixed, therefore a marker-specific assessment of such confounds is usually necessary (Braga Goncalves et al., 2016;Heistermann & Higham, 2015;Higham et al., 2020;Schwartz & Granger, 2004). Environmental contamination of samples can be introduced through a variety of ways, for example, mixing with soil, feces, or contact with vegetation. Contamination of samples through contact with vegetation is one of the most difficult sources of contamination to avoid because the collection of urine directly from the stream of a wild animal is not always possible and samples are regularly collected from the surface of vegetation (Fedurek et al., 2016;Higham et al., 2011;Rincon et al., 2019;Surbeck et al., 2012;Thompson González et al., 2020). Although the potential effects of leaf-borne contaminants on urinary biomarker measurement have been successfully ruled out for some metabolites (testosterone and creatinine: Muller & Wrangham, 2004;Marshall & Hohmann, 2005;

T A X O N O M Y C L A S S I F I C A T I O N
Ecophysiology and estrone conjugates: Knott, 2005), this is yet to be confirmed for OS markers.
Third, we assessed the effect of a delay between sample collection and flash-freezing in liquid nitrogen on OS measurements, something which is often unavoidable and difficult to standardize at remote field sites. Markers of oxidative status are generally considered stable during short-term storage before freezing (blood: at 0-4°C for 40 h and at 21-22°C for 40 h, Koracevic et al., 2001;4°C for up to 24 h, Nussey et al., 2009;3 h to 48 h at 4°C and 20°C, Jansen et al., 2013a;urine: 20°C for 26 h, Lee & Kang, 2008;4°C and 25°C for 24 h, Matsumoto et al., 2008). While these results are promising, whether the same degree of urinary marker stability would be retained at the higher ambient temperatures often found in field conditions in the tropics requires evaluation.
Fourth, we also considered how duration of frozen storage affects OS markers. Field studies of wild animals are often conducted in remote locations with limited access to laboratory equipment meaning samples are often stored for months or years before analysis. Additionally, length of time in storage is a particularly difficult confound to standardize because samples are normally collected over a long period of time and are assayed in the lab in one or several batches. Therefore, ensuring stability of OS markers during storage is necessary to reliably compare samples. OS markers in blood have been found to be stable in long-term storage after freezing for up to 2 years (−20°C for 1 month, Koracevic et al., 2001 We investigated the effect of these four putative methodological confounds on OS marker levels in the Zanzibar red colobus (Piliocolobus kirkii), an endangered primate for which no physiological data have been reported either from the wild or from captivity.
The redox status of an organism is the result of a complex cascade of processes and therefore it is necessary to measure multiple markers representing different aspects of these processes to properly capture the OS an animal is facing (Speakman et al., 2015). We chose four complementary OS markers representing different aspects of the oxidative status of the animal; two markers of oxidative damage: 8-hydroxy-2′-deoxyguanosine (8-OHdG), a marker of DNA oxidative damage and malondialdehyde (MDA), a marker of lipid peroxidation, and two markers of antioxidant capacity: total antioxidant capacity (TAC) and uric acid (UA). These markers are known to be stable, represent system-wide levels of OS, and have a variety of commercial assays available to test for them. Because of this, these markers have been used to measure OS in a variety of contexts, for example, in studies of wildlife conservation (French et al., 2017), life history (Christensen et al., 2016;Thompson González et al., 2020), behavioral ecology Georgiev, Emery Thompson, et al., 2015;Georgiev, Muehlenbein, et al., 2015), and socioecology . Therefore, our choice of markers represents a useful marker set for ecologists and conservationists.
We examined how the concentrations of these four markers were affected by (1)

| Study site and subjects
We sampled 40 wild Zanzibar red colobus (5 adult males, 35 adult or subadult females) from three groups in and around the edges of Jozani-Chwaka Bay National Park, Zanzibar (6.233°S, 39.404°E).
The Zanzibar red colobus is endemic to the island of Unguja where there are ca. 6000 individuals, 50% of which are found at this national park (Davenport et al., 2019). The subjects of this study are exposed to high levels of habitat disturbance and human activity from roads, tourism, and nearby villages and farms Olgun et al., 2021;Siex & Struhsaker, 1999). They do not receive provisioned food. The mean maximum and minimum temperature between February 2019 and February 2020 at Jozani-Chwaka Bay National Park were 34.5°C and 21.7°C, respectively (Zanzibar Red Colobus Project, unpublished data). The mean daily temperature variation was 12.7°C.

| Urine sampling and storage
We opportunistically collected 225 urine samples from the 40 colobus (mean 6.5 samples per individual, range 1-29) typically between 7:00 and 18:00 h over a period of 12 months (August 2018-September 2019). We collected samples immediately after excretion from identified individuals which could be distinguished using facial markings and other distinguishing features (e.g., scars, injuries, and posture/shape). We either caught urine midstream using a plastic bag on the end of a catchpole or pipetted fresh urine splatter from the leaves. We only collected samples, which were not visibly contaminated with feces or detritus. Samples were carried in the dark and on ice in insulated lunchboxes until flash-freezing in liquid nitrogen was possible later the same day. The samples were then transported to the UK in dry shippers below −150°C. Once at the laboratory, all samples were stored in −80°C freezers until assaying.

| Freezing delay experiment
To examine the effect of varying delays to flash-freezing of urine samples on OS marker measurement, we conducted a field experiment with seven urine samples. Upon collection of the sample, we briefly mixed and aliquoted each sample into four tubes and stored them on ice in an insulated lunchbox as described above. Upon return to the field base, we flash froze the first sample in liquid ni-

| Oxidative stress marker analysis
We measured four markers of OS in all urine samples: a marker of DNA oxidative damage (8-OHdG), a marker of lipid oxidative damage (MDA), and two markers of antioxidant capacity (TAC and UA).

8-OHdG concentration was measured using the Invitrogen DNA
Damage Competitive ELISA kit (catalogue number: EIADNAD). The concentration of MDA was measured using high-performance liquid chromatography (HPLC) with no sample dilution. The concentration of TAC was measured using the Cayman Chemical Antioxidant Assay kit (catalog number: 709001). The concentration of UA was measured using the Cayman Chemical Uric Acid Assay kit (catalog number: 700320). All assays were carried out as per the manufacturer's instructions and to obtain values within the sensitivity range of the assay, we diluted samples 1:150 for 8-OHdG, 1:100 for TAC, and 1:200 or 1:400 for UA. We adjusted the concentrations of all markers for urine dilution using specific gravity measured in the undiluted samples (Anestis et al., 2009). For 8-OHdG, TAC, and UA, we assayed each sample in duplicate within the same plate and we repeated two samples across all plates as inter-assay controls. The intra-assay coefficients of variation (CVs) were 9.8% for 8-OHdG, 7.2% for TAC, and 4.5% for UA. The inter-assay coefficients of variation (CVs) were 11.7% for 8-OHdG, 24.7% for TAC, and 14.8% for UA. Due to the high inter-assay CV for TAC, we included plate as a random effect in the mixed model. To estimate the repeatability of measurement of HPLC, we analyzed 20 samples in duplicate which had an average CV of 4.22%.
We removed all samples for which specific gravity could not be measured due to values falling outside the detection range of the specific gravity meter (range 1-1.05) (N = 56). Two samples were removed because they had a low specific gravity (<1.004), which was leading to inflated marker concentrations (Thompson González et al., 2020). We removed 43 8-OHdG measurements, 1 MDA measurement, 11 TAC measurements, and 1 UA measurement due to having a CV > 15%. Some samples were assayed for some markers and not others due to small sample volume and budgetary constraints.

| Statistical analyses
2.5.1 | Cross-sectional analysis: Testing the effects of environmental contamination, time of day, and duration of sample storage before and after freezing on OS marker measurement We used a systematic model selection method in which we constructed a set of candidate GLM models to investigate the impact of the four methodological confounds [method of collection (two levels: plastic (n = 135) and leaves (n = 32)), time of day (decimal hours past midnight), freezing delay (decimal hours), and timefrozen (decimal weeks)] on each OS marker. Because we did not have a priori predictions about the effects, each set of candidate models consisted of all combinations of the covariates and a null model containing only the random effects. Collection method was not included in the TAC and UA models because all assayed samples were collected on plastic. In all models, individual monkey ID was included as a random effect to account for multiple sampling of individuals. Plate was also included as a random effect in the TAC models due to these assays having high inter-plate CVs. Finally, the number of freeze-thaw cycles was included as a fixed effect in the MDA model because 10 samples had 2 additional freeze-thaw cycles. This was included as a fixed effect instead of a random effect because it had only two levels. The number of freeze-thaw cycles of samples assayed for the remaining three markers was the same so it was not necessary to account for it statistically. For a full list of model structures, please see Table A1. The models were constructed using the lme4 package in R (Bates et al., 2015). We checked model residuals for normality and homogeneity by visual inspection of qqplots and scatterplots of fitted values versus standardized residuals, respectively.
We log-transformed 8-OHdG, MDA, and UA measurements to homogenize and normalize the residuals. Collinearity was not an issue in these models (variance inflation factors <3.0; car package, Fox & Weisberg, 2019). The candidate models in each set were ranked based on AIC c (Akaike's information criterion corrected for small sample size bias) to select the most parsimonious model with the lowest AIC c value and highest AIC c model weight. In each set of candidate models, more than one model had support (Δ[Q ] AICc <2) and therefore we carried out multi-model inference using model averaging with shrinkage in the AICcmodavg package in R (Mazerolle, 2020). Model averaging with shrinkage calculates weighted averages of the estimates based on all candidate models whereby models not containing the variable of interest are assigned a value of 0 for the β and variance. This is considered more robust method of model averaging than only averaging the models containing the variable of interest (Burnham & Anderson, 2002;Cade, 2015).

| Diurnal variation in OS markers
In addition to examining the effect of time of day on OS marker levels cross-sectionally as described above, we identified matched pairs of samples that were collected in the morning (before 12 p.m., mean = 09:24 a.m.) and the afternoon (after 12 p.m., mean = 2:57 p.m.) from the same individual on the same day (nine pairs). If there were multiple samples in the morning or afternoon from the same individual, we averaged the marker concentrations across these samples. The average difference in collection time between morning and afternoon samples was 5 h and 36 min. We tested for differences between morning and afternoon urinary MDA levels using a Wilcoxon matched-pairs test. This analysis was only carried out for MDA because there were not enough measurements of the other markers to conduct a matched-pairs analysis following the exclusion of samples with an intra-sample CV above 15%. We also calculated the coefficient of variation each for samples collected in the morning and for samples collected in the afternoon across the 12-month dataset for all markers. We compared these CV values to determine whether OS markers are more variable in the morning or afternoon. These analyses were conducted on samples collected on plastic only.

| Field experiment testing the effect of freezing delay on OS marker levels
To investigate how consistent OS measurements were across 2-h freezing delay increments, we calculated Kendall's concordance coefficients for each marker to investigate changes in rank order of samples following different freezing delays and we calculated the intra-class correlation coefficient (ICC) for each marker based on single-rating, absolute agreement, two-way mixed-effects model. These analyses were carried out using DescTools (Signorell et al., 2021) and irr (Gamer et al., 2019), respectively. To interpret the results of the ICC, we used commonly accepted cut-off values for qualitative ratings of agreement where less than 0.5 = poor reliability, 0.5-0.75 = moderate reliability, 0.75-0.9 = good reliability, and 0.9 and above = excellent reliability (Koo & Li, 2016).
Additionally, for illustrative purposes, we calculated the percentage change in OS concentration from time 0 for each freezing delay interval for each sample. All these samples were collected on plastic. This analysis was only conducted for MDA and 8-OHdG due to budgetary constraints. After excluding samples as described above, we analyzed seven sets of aliquots for MDA and four for 8-OHdG because three of the sets of 8-OHdG aliquots were incomplete.
All analyses were carried out using R Studio (RStudio Team, 2020).

| Cross-sectional analysis: Testing the effects of environmental contamination, time of day, and duration of sample storage before and after freezing on OS marker measurement
Model selection identified 13, 11, 7, and 7 plausible models (Δ[Q] AICc <2) for 8-OHdG, MDA, TAC, and UA, respectively (Appendix 1). Multi-model averaging with shrinkage showed OS marker concentration was not affected by any of the investigated methodological confounds ( Table 1).

| Diurnal changes in OS
MDA concentrations were higher in samples collected in the morning than in the afternoon (N = 9, V = 41, p-value = .02734, Figure 1).
Similarly, the mean CV across freezing delays was 18.5% for 8-OHdG  Table 2). Despite having moderate-to-good reliability between freezing delay steps, there was frequent rank-order changes between the freezing delay steps (Figure 2b).

| DISCUSS ION
In this study, we demonstrate that OS markers are generally robust to methodological confounds that are common in field research.
However, the reliability of some OS estimates may be affected by the duration of sample storage before freezing and the time of day the sample was collected. We highlight some potential confounds that future studies should consider and provide practical recommendations for the measurement of OS in urine collected non-invasively and opportunistically from wild animals. Finally, we discuss the potential for use of urinary OS markers in field research.

| Diurnal variation
While 8-OHdG, TAC, and UA did not exhibit a pronounced diurnal variation in Zanzibar red colobus urine samples, MDA was higher in the morning than in the afternoon. This result was only apparent in the matched-pairs analysis and not in the cross-sectional analysis. Following the removal of samples with intra-sample CVs above 15%, we did not have enough OS concentration measurements to conduct a matched-pairs test for 8-OHdG, TAC, and UA. Therefore, diurnal patterns could be present in these three markers if a matched-pairs analysis were conducted in the future. urine: Homo sapiens, Kanabrocki et al., 2002;Miwa et al., 2004; and Pan troglodytes, Thompson González et al., 2020). One could argue that these conflicting conclusions may be due to the analytical procedures used. Where the analysis has been conducted in urine, variation in urine concentration could mask true diurnal patterns in OS production because urine concentration varies widely throughout the day. Some studies adjust OS measurements for urine concentration using specific gravity or creatinine Controls were frozen as soon as possible after collection and then aliquots were frozen following 2-, 4-, and 6-h delays. Sample numbers 1, 2, and 5 are missing in for 8-OHdG due to failed measurements in some aliquots (Thompson González et al., 2020) while others do not (Kanabrocki et al., 2002). We adjusted for urine concentration using specific gravity in this study, but still found conflicting results to those

| Environmental contamination
We found that contact with leaves had no effect on OS measurements for any of the markers we studied. This lack of effect of leaf contamination is similar to that which has been observed in urinary steroid measurements (Knott, 2005;Marshall & Hohmann, 2005;Muller & Wrangham, 2004). This result suggests that urinary OS biomarkers are sufficiently stable to be collected from either plastic or leaf surfaces, which would give flexibility in sample collection method to field biologists that may not always succeed in placing a plastic sheet underneath their study subjects with full precision and perfect timing. We would, however, recommend additional testing of the impact of collection method on urinary OS marker concentration using an experimental approach following Muller and Wrangham (2004) and Knott (2005).

| Storage time before freezing
Our cross-sectional analysis showed that freezing delay up to 520 min did not have a significant effect on any of the four OS markers that we measured. Our study agrees with previous field research cross-sectionally testing the effect of freezing delay on blood OS marker concentrations Nussey et al., 2009).
However, our experimental analysis of MDA and 8-OHdG measurements showed that values of these markers were highly variable and inconsistent across freezing delays of up to 360 min, more than has been observed in clinical OS studies (Lee & Kang, 2008;Matsumoto et al., 2008) and other urinary biomarkers (e.g., urinary neopterin: Heistermann & Higham, 2015). This discrepancy might be due to a difference in ambient temperature at our tropical field site and in the lab conditions used in these studies. The difference in results between our cross-sectional and experimental study suggests that freezing delay may not be an issue for higher-level comparisons of OS between groups of samples, but that comparisons between individual samples exposed to different freezing delays may not be informative. An investigation of the rank orders of individual samples is missing from the previous studies of freezing delay and urinary OS measures (Lee & Kang, 2008;Matsumoto et al., 2008) and therefore these fine-scale differences between samples may have been overlooked. Future research should consider the potential confounding effects of storage on OS measurements, especially if individual samples are to be compared.
We found that for both 8-OHdG and MDA, OS concentration oscillated across freezing delay steps where we would expect a steady decrease. Even though samples were mixed using a pipette prior to aliquoting, we suspect this oscillation might be caused by drawing the four aliquots from a heterogenous urine sample, rather than due to lab measurement error, as that was low (CV <15%). Therefore, there may be additional confounding factors affecting the reliable comparison of OS measurements from individual samples and we advise that spot sampling (whereby single samples are taken to represent the concentration for a unit of analysis, e.g., individuals, sites, and periods) should be avoided. Instead, multiple samples should be taken to calculate mean OS concentrations for each unit of analysis.
MDA was more stable across the four freezing delay steps than 8-OHdG. In fact, using a well-accepted CV cut-off value of 15%, most of the 8-OHdG measurements would be deemed unreliable if the aliquots were treated as measurement replicates. There is no indication in the literature that 8-OHdG is a less stable molecule than MDA (Cooke et al., 2008) so the difference in variability between the markers most likely comes from the methods we used to quantify them (ELISA for 8-OHdG and HPLC for MDA). While ELISAs have been widely used to measure 8-OHdG in urine samples, they are subject to greater measurement variability (Barregard et al., 2013) than chromatographic approaches such as HPLC (Graille et al., 2020). An ELISA is adequate for comparing relative urinary OS levels between groups (where multiple samples contribute to a mean estimate) but not to reliably measure exact concentrations of markers (Cooke et al., 2008;Yoshida et al., 2002). Researchers should be aware of the analytical limitations of different methods and select the most appropriate method to address their question.
For example, if individual samples need to be compared, HPLC approaches, which have higher specificity and sensitivity, would be more desirable.  Jansen et al., 2013b;Costantini et al., 2017;Jansen et al., 2017;Rubio et al., 2018;urine: Matsumoto et al., 2008;Martinez & Kannan, 2018;Thompson González et al., 2020). However, our result is promising for future studies utilizing OS markers, both in field and laboratory environments.

| CON CLUS IONS
The application of non-invasive OS measurement in the field will strengthen our ability to address exciting theoretical and applied questions, for example, the role of OS in life-history trade-offs, development, aging, reproduction, and the effects of anthropogenic disturbance and environmental conditions on wild animal physiology, health, and fitness. The redox system is highly conserved across taxa and plays an important role in many biological processes. Therefore, it can be studied in a wide variety of contexts and study systems, something that will only be aided by being able to study natural populations in a non-invasive way. Additionally, being able to investigate these questions in populations engaging in natural social interactions and facing natural resource restrictions will make the insights more pertinent than those that might be gained through laboratory studies.
However, applying physiological methods in the field increases the potential for methodological confounds (e.g., from environmental contamination, differences in freezing delays, heteroge-

ACK N OWLED G M ENTS
We thank the Department of Forestry and Non-Renewable Natural

CO N FLI C T O F I NTE R E S T
All authors gave final approval for publication and have no conflicts of interest.  AICc of each model and the top-ranked model, and AICcWt is the AIC weight of each model. Estimates and SE are set to 0 for models that do not contain the variable of interest so as not to inflate estimate values. The models in bold represent the 95% confidence set of models.